NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Programmatic Imitation Learning From Unlabeled and Noisy Demonstrations

https://doi.org/10.1109/LRA.2024.3385691

Xin, Jimmy; Zheng, Linus; Rahmani, Kia; Wei, Jiayi; Holtz, Jarrett; Dillig, Isil; Biswas, Joydeep (June 2024, IEEE Robotics and Automation Letters)

Full Text Available
SocialGym: A Framework for Benchmarking Social Robot Navigation

https://doi.org/10.1109/IROS47612.2022.9982021

Holtz, Jarrett; Biswas, Joydeep (October 2022, Intelligent Robots and Systems (IROS), IEEE/RSJ International Conference on)

Full Text Available
STEADY: Simultaneous State Estimation and Dynamics Learning from Indirect Observations

https://doi.org/10.1109/IROS47612.2022.9981279

Wei, Jiayi; Holtz, Jarrett; Dillig, Isil; Biswas, Joydeep (October 2022, Intelligent Robots and Systems (IROS), IEEE/RSJ International Conference on)

Full Text Available
SOCIALGYM: A Framework for Benchmarking Social Robot Navigation

Holtz, Jarrett; Biswas, Joydeep (January 2022, Intelligent Robots and Systems (IROS), IEEE/RSJ International Conference on)

Full Text Available
Iterative Program Synthesis for Adaptable Social Navigation

Holtz, Jarrett; Andrews, Simon; Guha, Arjun; Biswas, Joydeep (October 2021, Intelligent Robots and Systems (IROS), IEEE/RSJ International Conference on)

Full Text Available
Iterative Program Synthesis for Adaptable Social Navigation

https://doi.org/10.1109/IROS51168.2021.9636540

Holtz, Jarrett; Andrews, Simon; Guha, Arjun; Biswas, Joydeep (September 2021, 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS))

Robot social navigation is influenced by human preferences and environment-specific scenarios such as elevators and doors, thus necessitating end-user adaptability. State-of-the-art approaches to social navigation fall into two categories: model-based social constraints and learning-based approaches. While effective, these approaches have fundamental limitations – model-based approaches require constraint and parameter tuning to adapt to preferences and new scenarios, while learning-based approaches require reward functions, significant training data, and are hard to adapt to new social scenarios or new domains with limited demonstrations.In this work, we propose Iterative Dimension Informed Program Synthesis (IDIPS) to address these limitations by learning and adapting social navigation in the form of human-readable symbolic programs. IDIPS works by combining pro-gram synthesis, parameter optimization, predicate repair, and iterative human demonstration to learn and adapt model-free action selection policies from orders of magnitude less data than learning-based approaches. We introduce a novel predicate repair technique that can accommodate previously unseen social scenarios or preferences by growing existing policies.We present experimental results showing that IDIPS: 1) synthesizes effective policies that model user preference, 2) can adapt existing policies to changing preferences, 3) can extend policies to handle novel social scenarios such as locked doors, and 4) generates policies that can be transferred from simulation to real-world robots with minimal effort.
more » « less
Full Text Available
Robot Action Selection Learning via Layered Dimension Informed Program Synthesis

Holtz, Jarrett; Guha, Arjun; Biswas, Joydeep (November 2020, Conference on Robot Learning (CoRL))
null (Ed.)
Action selection policies (ASPs), used to compose low-level robot skills into complex high-level tasks are commonly represented as neural networks (NNs) in the state of the art. Such a paradigm, while very effective, suffers from a few key problems: 1) NNs are opaque to the user and hence not amenable to verification, 2) they require significant amounts of training data, and 3) they are hard to repair when the domain changes. We present two key insights about ASPs for robotics. First, ASPs need to reason about physically meaningful quantities derived from the state of the world, and second, there exists a layered structure for composing these policies. Leveraging these insights, we introduce layered dimension-informed program synthesis (LDIPS) - by reasoning about the physical dimensions of state variables, and dimensional constraints on operators, LDIPS directly synthesizes ASPs in a human-interpretable domain-specific language that is amenable to program repair. We present empirical results to demonstrate that LDIPS 1) can synthesize effective ASPs for robot soccer and autonomous driving domains, 2) requires two orders of magnitude fewer training examples than a comparable NN representation, and 3) can repair the synthesized ASPs with only a small number of corrections when transferring from simulation to real robots.
more » « less
Full Text Available
Robot Action Selection Learning via Layered Dimension Informed Program Synthesis

Holtz, Jarrett; Guha, Arjun; Biswas, Joydeep (November 2020, Conference on Robot Learning)
null (Ed.)
Action selection policies (ASPs), used to compose low-level robot skills into complex high-level tasks are commonly represented as neural networks (NNs) in the state of the art. Such a paradigm, while very effective, suffers from a few key problems: 1) NNs are opaque to the user and hence not amenable to verification, 2) they require significant amounts of training data, and 3) they are hard to repair when the domain changes. We present two key insights about ASPs for robotics. First, ASPs need to reason about physically meaningful quantities derived from the state of the world, and second, there exists a layered structure for composing these policies. Leveraging these insights, we introduce layered dimension-informed program synthesis (LDIPS) – by reasoning about the physical dimensions of state variables, and dimensional constraints on operators, LDIPS directly synthesizes ASPs in a human-interpretable domain-specific language that is amenable to program repair. We present empirical results to demonstrate that LDIPS 1) can synthesize effective ASPs for robot soccer and autonomous driving domains, 2) enables tractable synthesis for robot action selection policies not possible with state of the art synthesis techniques, 3) requires two orders of magnitude fewer training examples than a comparable NN representation, and 4) can repair the synthesized ASPs with only a small number of corrections when transferring from simulation to real robots.
more » « less
Full Text Available

Search for: All records